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ABSTRACT 

The luminosity functions (LFs) of star cluster systems (i.e. the number of clusters 
per luminosity interval) are vital diagnostics to probe the conditions of star cluster 
formation. Early studies have revealed a clear dichotomy between old globular clusters 
and young clusters, with the former characterised by Gaussian-shaped LFs, and the 
latter following a power law. Recently, this view was challenged by studies of galaxy 
merger remnants and post-starburst galaxies. In this paper we re-evaluate the young 
(< few hundreds of Myrs, with the majority < few tens of Myrs) star cluster system in 
the ongoing spiral-spiral major merger system NGC 4038/39, the "Antennae" galaxies. 
The Antennae galaxies represent a very active and complex star- forming environment, 
which hampers cluster selection and photometry as well as the determination of ob- 
servational completeness fractions. A main issue of concern is the large number of 
bright young stars contained in most earlier studies, which we carefully exclude from 
our cluster sample by accurately determining the source sizes. The resulting LFs are 
fitted both with Gaussian and with power-law distributions, taking into account both 
the observational completeness fractions and photometric errors, and compared using 
a likelihood ratio test. The likelihood ratio results are rigidly evaluated using Monte 
Carlo simulations. We perform a number of additional tests, e.g. with subsets of the 
total sample, all confirming our main result: that a Gaussian distribution fits the ob- 
served LFs of clusters in this preferentially very young cluster system significantly 
better than a power-law distribution, at a (statistical) error probability of less than 
0.5 per cent. 

Key words: globular clusters: general - open clusters and associations: general - 
galaxies: star clusters - galaxies: evolution - methods: data analysis 



1 INTRODUCTION 

1.1 Star Clusters: evolution and interpretation 

Star clusters (SCs) form nearly instantaneously through the 
collapse of giant molecular gas clouds (GMCs). Hence all 
stars within a SC are approximately coeval, and represent 
a simple stellar population (SSP). A small number of pa- 
rameters, in particular their initial chemical composition 
and initial stellar mass function (IMF), are enough to de- 
scribe their colour and luminosity evolution on the basis of 
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a given set o f stella r evolutionary tracks or isochrones (e.g 



Schulz et al l 12001 lAnders fc Fntze-v. Alvenslebenl [2003, 



Bruzual fc Chariot! 12003). Therefore, observed spectropho- 



tometry properties of SCs are relatively easy and straight- 
forward to interpret. 

SC formation is a major mode of all star formation, 
and possibly even the dominant mode in str ong starbursts 
triggered in gas-ric h galaxy mergers (e.g., iMeurerl 1 19951 , 



Ide Griis et ai] [003a) 



SCs are excellent tracers of their parent galaxy's star- 
forming properties. Radial age gradients in SC systems and 
age differences between SF regions tell us about the dynam- 
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ical evolution of a starburst (|de Griis et al.ll200ll ). This is 
owing to the fact that SCs are not only relatively easy to 
model but also much easier to analyse than the integrated 
light of the galaxy, because SCs can be studied individu- 
ally. Even without individual SC spectroscopy, multi-band 
imaging in at least four suitable passbands allows us to de- 
termine the age, m etallicity, extinction, and mass of a SC 
l| Anders et al.llo04al lbh. The age and metallicity distributions 
of a SC system directly reveal the SF and chemical enrich- 
ment histories of its parent galaxy, to much higher precision 
than studies of the galaxy's integrated light, since the latter 
is always dominated by the last major epoch of SF (see e.g. 
iFritze-v. Alvenslebenll2004 ). The long-lived SCs, and the old 
globular cl usters (GCs) in pa rticular, hold a key role in this 
respect Ccf. lWest et~al]|2004 ). 



1.2 Star Cluster Systems 

The most commonly used diagnostics to explore the proper- 
ties and evolution of entire SC systems are their luminosity 
and mass function^] (LFs, MFs). 

For old GC systems in the local Universe, both LFs 
and MFs are of log- normal ("Gaussian") shape, with 
ver y similar parameters a mong a wid e variety of galax- 
ies (jAshman fc ZepJ Il998l . iHarrisl Il99ll ; min or trends with 
meta l content can easily be accounted for, cf. lAshman et al.l 
Il995l ). with a Gaussian peak (or "turnover") magnitude at 
Mv = —7.3 mag and a Gaussian FWHM of 2.8 mag, corre- 
sponding to a Gaussian a = 1.2 mag. This universal turnover 
of GC LFs is often used as "secondary" long-range extra- 
galactic distance indicator, and hence for determinations 
of the Hubble constant and, as a consequence, of the ex- 
pansion rate of the Universe ([Sandage fc Tammannl 1 19951. 
lForbedll99d . lKavelaars et al.ll2000l ) 

Young star clusters (YSCs, often referred to as "open" 
or "populous" clusters) in the local Universe seem to be 
of a different character than the ubiquitous GCs. In the 
Milky Way, young clusters are mainly sparse, low-mass ob- 
jects (~ 10 3 — 10 4 Mo), with low concentration and life- 
times of order 10 8 yr, that will soon dissolve into the field 
star population. The Galact i c ope n cluster compilation of 
Ivan den Bergh fc Lafontaind (Il984l ) suggests a LF of power 



law-type with slope a < 0.5 (but it might be steeper due 
to incompleteness effects). The only local galaxies with a 
large number of young clusters with masses > 10 4 M 
(up to masses < 3 x 10 5 M p ) are the Magellanic Clouds . 
Wor k bvlElson fe Fall il 19851 ) , lElmegreen fc Efremovl (Il9971 l 
and iHunter et alj (|2o61 i" suggested a luminosity spectrum 
with a power law of the form Nysc(L)(1L oc L a dL, where 
NYSc(L)dL is the number of young star clusters (YSCs) 



1 Following iMcLauehlin fc Pudritd lll996l ) and 

iParmentier fc Gilmorj 1120061 we denote a luminosity spec- 
trum (LS) the number of objects per linear luminosity interval 
AN/dL, while we refer to the luminosity function (LF) to de- 
scribe the number of objects per logarithmic interval dN/dlogL. 
Magnitudes (mag) are logarithmically related to luminosities (L) 
logL oc mag. If a LS can be described by a power law L~ OLLS , 
the slope of the corresponding LF is a^p = a^g — 1 and if 
expressed in magnitudes ce L p> = 0.4 X («ls — 1) = 0.4 X a^p. 
In this paper we discuss LFs expressed in magnitudes. 
Mass functions are defined similarly. 



with luminosities between L and L + dL, with a ~ —2 (cor- 
responding to a LF wit h a sa —1). Recently, this power-law 
result was challenged bv lde Griis fc Anders! (2006). who find 
significant deviations from a power law for clusters in the 
Large Magellanic Cloud (LMC) of ages younger than ~ 100 
Myr and masses below ~ 3 X 10 3 M Q . 

Rich systems of YSCs are routinely observed in star- 
burst galaxies, in particular in strong starbursts trig- 
gered by gas-rich ga l axy interactions and mergers (e.g., 



Whitmore et al.lll993l. ISchweizer fc Seitzerl[l998l.JZepf et alj 

ajj 



19991 . IZhang et all l200ll . Ide Griis et alj I200U l003al lbf) 



Merger remnants with post-starburst signatures also re- 
veal SC systems with ages of up to 1-3 Gyr (e.g., 
Ide Griis et aill003d . iGoudfrooii et al.ll2004| j, indicating that 
at least some fraction of the SCs formed during a merger- 
induced burst survived much longer than most of the open 
clusters in the Milky Way. The LFs of these YSC sys- 
tem s are usually adopt ed to be of power law-type (see 
e.g. EI Griis et al.ll003bl for a rec ent compil ation), but see 
IFritze-v. Alvenslebenl (Il998l.ll999h . as well as ide Griis et all 
(|003ch and lGoudfrooii et al.l (|2004T ) for Gaussian-shaped LFs 
of young and intermediate-age cluster systems, respectively. 

The lower maximum masses of the YSCs in the Milky 
Way as compared to YSCs in the LMC and in star- 
burst /merging galaxies could be (partially) understood by 
the (purely statistic al) size-of-sample effect, studied e.g. by 
IHunter et all (2003): In the latter galaxies the total number 
of YSCs is larger than in the Milky Way, therefore the MFs 
are sampled up to higher masses. 

It has hitherto remained unclear whether the differ- 
ence in shape between the power-law LFs of young SCs 
(but seelFritze-v. Alvenslebenlll998l.ll999l.lCresci et alj|2005l . 
Ide Griis et al.ll003d and lGoudfrooii et al.ll2004l for the Gaus- 
sian LFs in the Antennae galaxies [all clusters], NGC 5253 
[clusters older than 10 Myr], M82B [contains clusters at 
roughly 1 Gyr] and NGC 1316 [ contains clusters at roug hly 
3 Gyr], and the new results by Ide Griis fc Andersl 2006 on 
the LMC SC system) and old GC systems is caused by dif- 
ferences in the nature and formation of the two types of 
clusters, or whether the power law of young systems is sec- 
ularly transformed into the Gaussian distribution of old GC 
systems by selective destruction effects. 



1.3 Our case study: the Antennae system 

In this paper we study the YSC system of the Antennae 
galaxies (NGC 4038/39), the best-studied and nearest ex- 
ample of a major interaction between two massive gas-rich 
spiral galaxies, at a distance of 19.2 Mpc 0. In addition, 
the low inclination of both galaxies favours the detection 
and study of their SC systems. In the Antennae galaxies, 
both the individual SCs (see e.g. IWhitmore fc Schweizerl 



1 19951 . IFritze-v. Alvenslebenl fl99sl . Il999l . IZhang et al.l l200ll . 



2 Throughout this paper we adopt a distance to the Antcn 
nae galaxies of 19.2 Mpc (m — M = 31.4 mag; see e 



IWhitmore fc Zhand |2002| . iKassin et al]|2003l . iMetz et all 12004 
We point out that adop ting the distance of 1 3.8 Mpc (m — M = 
30.7 mag) suggested by ISaviane et aL I j--'nn-l| > would result in an 
overall shift of all absolute magnitudes by 0.7 mag towards fainter 
magnitudes, leaving the arguments presented in this paper unal- 
tered. 
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IWhitmore fc Zhang||20o"3, iFall et alj|2005l; for SCs in the ex- 



tended tidal tails see 



Knierman et al. 



200311 and the SC com - 



plexes (see e.g. IWhitmore et all 120051 . iBastian et aD 120061 ) 
have been studied extensively. In addition, the Antennae 
galaxies are a testbed for a large number of studies related 
to star formation and dy namical evolution in such a com- 
plex environment (see e.g. Barned 19881 iMirabel et~al. 



Table 1. Observation log. All data obtained in Cycle 5 (in Jan- 
uary 1996), using the HST/WFPC2 camera. 



Hibbard et al.ll200ll.lFabbiano et al.ll2003l . lMetz et al 



Wang et al.ll2004 iHibbard et al.ll2005l ). 



1998 



2004 



The LF of the YSCs in the Antennae galaxies have been 
studied extensively. However, so far n o consensus about the 
shape of the LF has been reached: see iFritze 
1999) in f avour for a Gaussia n shape. IWhitmore et al 



19991 1 and iMengel et all (|2005l ) for a broken-power-law 
shape (this could be interpreted as intermediate case, with 
the shallower slope at the faint end equiv alent to the Gaus- 
sian approaching the turnover, although iGieles et alj 120061 
interpret this broken power-law as a sign of a physical up- 
per m a ss limit of star clust ers) and lWhitmore fc Schweizerl 
[|l995h , IZhang fc Falll (|l999h in favour for a power law shape. 

Establishing whether or not the LFs of SC systems are 
the same or different in quiescently star-forming galaxies 
compared to merger- induced starbursts has far-reaching im- 
plications for our understanding of the star formation and 
star cluster formation processes themselves and their pre- 
sumed universality, and possibly of their environmental de- 
pendence. To assess whether or not there is a clear turnover 
in the SC LFs (Section [4}, we introduce statistical methods 
to model the sample selection procedure (Section l4.HI , which 
includes parametric modelling of the completeness function 
(our completeness studies for the Antennae's SC system are 
summarised in Section I2.2|l and, in a second step, a maxi- 
mum likelihood procedure to estimate the parameters of a 
power-law and a Gaussian distribution, respectively, from 
the data as modified by this completeness function. Apply- 
ing a likelihood ratio test and Monte-Carlo simulations to 
test its significance we find statistically significant evidence 
for the presence of a Gaussian- like turnover in the LF at ab- 
solute magnitudes between —9.5 and —8.0 mag in the U, B, 
and V bands, and at « —6.5 mag in the I band (although 
less significant, at a 68 per cent level). These results are 
presented in detail in Section \4. 21 

To investigate the robustness of these results, in Section 
4.41 we study subsets of the full sample, divided according 
to various criteria. Although details change, the conclusions 
are confirmed by these tests. The implications of our results 
are discussed in Section [5] 



2 OBSERVATIONS AND DATA REDUCTION 

We reanalyse the most homogeneous dataset of broad- 
band imaging observations of the Antennae system avail- 
able to date, obtained using the Hubble Space Telescope 
(HST)/Wide Field and Planetary Camera-2 (WFPC2) as 
part of programme GO-5962 (PI B. Whitmore; see Table 



3 i.e. at a confidence higher than 99.5 per cent (in fact, none of 
the 1000 simulated datasets shows a superiority of the Gaussian 
over the power law as strong as in the real data) 



Filter 


Johnson equiv. passband 


total exposure time (s) 


F336W 


U 


4500 


F439W 


B 


4000 


F555W 


V 


4400 


F814W 


I 


2000 



[TJ . The images were retrieved via the on-the-fly data reduc- 
tion pipeline OPUS from the HST data archive at the Space 
Telescope European Coordinating Facility (ST-ECF). 

The image alignment and cosmic-ray rejection were 
done using standard tasks in iraf. including tasks from STS- 

The source identification was done using a version of 
DAOphot, adapted to run under idl. Since we do not a 
priori know whether or not the potentially young SCs we 
are seeking are dynamically relaxed, we relaxed the round- 
ness and sharpness criteria slightly compared to the default 
values. We compared the number of objects found using flux 
thresholds of 3<Tb g and 4<Tb g , where <Tb g represents the r.m.s. 
of the background flux, but found virtually no difference. 
We used the > 4<7b g sources, because we expected other 
completeness-limiting aspects of the data analysis to have 
more severe impacts fsee !2.2[l . Subsequently, we considered 
only sources present in either all of the UBV, or in all of 
the BVI images. This procedure prevents excluding either 
severely extincted clusters (possibly lacking [/-band imag- 
ing) or extremely young objects (possibly lacking /-band 
data) . 

In total, we identified 7817 sources with fluxes above the 
threshold value in either of our t hree adjacent pas s band sets. 
However, as already shown by IWhitmore et alj l|l999l ) the 
majority of these sources are contaminating bright stars in 
the Antennae galaxies themselves. This is also clear from the 
estimated youth of the starburst, with a strong component 
within the last few x 10 7 yr, as is app arent from the presence 
of a large number of Hn regions (e.g. iNeff fc Ulvestadll2000ll 
and the observation of Ha emission fe.g. Iwhitmore et al.l 
1 19991 . see their Fig. 4). The number of bright supergiant-type 
stars, Luminous Blue Variables, and bright O-type stars is 
difficult to estimate, but it is expected to be large, with a 
fair number reaching absolute V-band magnitudes of ~ —9 
mag, and a few rare, Eta Carinae-type objects reaching even 
Mv ~ — 10 mag. However, the further cluster selection crite- 
ria we will apply remove the vast majority of these because 
of their properties, particularly the extendedness criterion. 



4 The Image Reduction and Analysis Facility (IRAF) is dis- 
tributed by the (U.S.) National Optical Astronomy Observatories, 
which is operated by the Association of Universities for Research 
in Astronomy, Inc., under cooperative agreement with the (U.S.) 
National Science Foundation. STSDAS, the Space Telescope Sci- 
ence Data Analysis System, contains tasks complementary to the 
existing IRAF tasks. 



© 0000 RAS, MNRAS 000, 000-000 



4 P. Anders et al. 



2.1 Photometry and cluster sizes 

The crowding of the sources, their large number, partially 
their spatial extent and the strong variations in the galactic 
background contribution render photometry of the clusters 
extremely difficult. While the large number of sources re- 
quires some automation in source photometry and cluster 
candidate selection, this is severely hampered by the other 
factors indicated above. 

Therefore, we developed an automated "cookbook" to 
improve aperture photometry by taking the sizes of the 
sources into account. For a number of cluster light profiles 
and a wide range of intrinsic cluster sizes we determined the 
"observed" size (as broadened by the point-spread function 
[PSF] and the diffusion kernel of the camera; the fitted pro- 
file used was a Gaussian, for simplicity and stability reasons) 
as well as the size-dependent aperture corrections (ACs) re- 
quired to ac count for all of the cluster li ght (for a full de - 
scription see I Anders et al.ll2006l ; hereafter I AC /sizes Paper]) . 
In the present paper we will apply the algorith ms presented 
and extensively tested in the I AC /sizes Paper] to the Anten- 
nae data. 

To each of the 7817 sources we fit a Gaussian light pro- 
file, as justified and validated in the |AC /sizes Paperl . and 
obtain aperture photometry using 3-pixel source apertures, 
and 5/8-pixel inner/outer radius sky annuli. These sizes were 
chosen as a good compromise between being small enough 
to avoid crowding and the potentially detrimental effects 
caused by the highly variable background, and large enough 
to average out the Poissonian noise from the source flux and 
aperture centring effects. 

Sinc e the majority of the c lusters are younger than 25 
Myr (see IWhitmore et al.lll999l and Section |4.3|1 . we do not 
expect them to have alr eady developed tidally-truncated 
King profiles dKind Il962l ). We therefore assume the aver- 
age profile r eprese nting the YSCs in the LMC, namely an 
lElson et al] (|l987h (EFF) profile, with a power-law slope 
of —3, corresponding to an EFF15 profile in the BAOlab 
software environment. If the measured sizes in the differ- 
ent passbands differed, the luminosity-weighted mean of the 
measurements was taken as representative size. Through- 
out this paper we will use the FWHM of the EFF15 model 
as measure for the "size" of the objects, unless otherwise 
stated. 

Using the size information just obtained and the aper- 
ture photometr y of the clusters , we apply the recipes de- 
veloped in the I AC /sizes Paperl . including size-dependent 
ACs and background-oversubtraction corrections. The to- 
tal brightnesses we re corrected for Galactic extinction 
ijSchlegel et aLl figgS - ). converted from the STMAG system of 
the HST observations to the VEGAMAG systerrQ, and subse- 
quently transformed to absolute magnitudes. 

For a source to be included in our final cluster sample 
it has to fulfil the following criteria: 

5 BA Olab is a powerful image analysis suite, described in lLarsenl 
lll999h . It is especially powerful for cluster size measurements of 
marginally resolved objects, and for the creation of artificial clus- 
ter images, and thus for detailed completeness tests. 

6 The additive offsets required are solely filter-dependent, as they 
result only from the different calibration spectra. They can be 
retrieved from the authors. 



• It must have been found by the DAOPHOT-like idl rou- 
tine. 

• It must be detected at least in either all UBV or all 
BVI images, to filter out spurious detections and residual 
cosmic rays. In addition, since we want to determine phys- 
ical para meters for these clu sters using the ANALYSED al- 
gorithm (I Anders et aljloolaf ) in a subsequent paper (Paper 
II), we need multi-passband photometry, ideally for a large 
number of passbands and a wide wavelength range. 

• The size determination must have converged. We need 
to correct o ur photometry us ing the size-dependent ACs de- 
rived in the I AC /sizes Paperl . Therefore, size information is 
essential. However, the size determination becomes increas- 
ingly difficult and unreliable for faint objects, hence also 
reducing the sample's completeness (see Section [2, 2 ) . 

• The converted (= intrinsic) FWHM must be in the 
range from 0.5 to 10 WF3 pixels (i.e., pixels on the wide- 
field-3 chip of the WFPC2 camera), corresponding to a 
FWHM of 4.6 to 93 pc (or a half-light radius R 1/2 of 5.2 
to 105.1 pc, assuming an average young LMC "EFF15"- 
type cluster profile) at the adopted distance of the Anten- 
nae. This r ange covers the ra nge for which the formulae de- 
rived in the I AC /sizes Paper! can be applied confidently. For 
clusters smaller than .5 WF3 pixel th e correction equa- 
tions determined in the I AC /sizes Paperl become inaccurate. 
In addition, the lower size cut-off at 0.5 pixel (imposed to en- 
sure that the source is extended, and hence a likely cluster) 
significantly reduces the sample contamination by bright su- 
pergiants, which would appear as point-like objects: the ob- 
served size would be on the order of the filter-dependent 
PSF size (~ 1.5 — 2 pixels). The upper size limit was cho- 
sen as a compromise in view of the contamination by cluster 
complexes and the number of sources. To satisfy both con- 
straints we will, where appropriate, distinguish between SC 
samples characterised by sizes in the range from R1/2 — 5 
to 25 pc ("small" clusters) and our full sample. Note that 
cluster complexe s in M51 and the Antenna e have diameters 
of > 100 pc (see lBastian et al.ll005bl l2006h . 

• The photometric error in each passband must be ?C 0.2 
mag, because we need accurate cluster photometry to deter- 
mine the physical cluster properties within reasonable un- 
certainty ranges in Paper II. 

The final full sample contains 752 clusters satisfying all 
of these constraints, the small cluster sample contains 365 
clusters. 



2.2 Completeness determination 

In particular for a situation as complex as in the Anten- 
nae system, completeness determinations are as difficult as 
cluster selection and photometry. 

The ultimate goal for the determination of the com- 
pleteness function of a cluster sample is the determination 
of a local completeness fraction for each cluster, by taking 
into account all effects of the local environment of each clus- 
ter and all relevant selection effects. 

In reality, at least the detailed spatial dependence of the 
completeness functions can hardly ever be determined accu- 
rately for each cluster. Here, we determine the completeness 
functions for two different cluster sizes (for intrinsic sizes of 
1 and 2 pixels, chosen, respectively, to represent the size bin 
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Figure 1. The Antennae galaxies, with the two regions for which 
the completeness functions were determined indicated: on the left- 
hand side the DISC region, on the right-hand side the OVERLAP 
region. The clusters are circled. Orientation and length scales in- 
dicated in the image. 



with the largest number of clusters, and roughly approxi- 
mating the median cluster size) in two distinct regions of 
the galaxies, distinguished by their average source density. 
The location of the regions are shown in Fig. [1] We labelled 
the regions DISC region (on the left-hand side, in the disc of 
NGC 4038) and overlap region (on the right-hand side, in 
the region where the discs are overlapping, and the presence 
of large amounts of dust is seen in white) . 

For both regions, we performed a number of tests to 
estimate the completeness functions. For the DISC region we 
will present the different methods and compare their results 
(see Figs. [2] and [3]). For the most sophisticated method (tak- 
ing into account filter cross-correlation, photometric accu- 
racy and the requirement for the cluster size determination 
to converge) we will show the results for the overlap region 
as well (see Fig. [3]). The results are summarised in Table [2] 

The simplest way to estimate completeness functions is 
by distributing Gaussian-shaped artificial sources ("stars") 
with a range of brightnesses onto the image, e.g. by using 
the iraf routine mkobj with radii corresponding to the PSF 
sizes, and then to determine the fraction of sources being 
recovered by the source finding algorithm. This can be done 
for each passband separately (see Fig. [2] upper panel, thin 
solid lines with symbols). However, for source identification 
we require that a detection must be positive in at least 3 
adjacent passbands out of the 4 available. For this cross- 
correlation, a spectral energy distribution (SED) must be 
assumed. We investigated the cases of a flat SED (all colours 
are equal to 0, by definition), an SED representative of a 
young solar-metallicity cluster (age = 12 Myr, Z — 0.02 = 
Z©), and one for an intermediate-age subsolar-metallicity 
cluster (age = 100 Myr, Z = 0.008 = 0.4 Z ); see the thin 
long-dashed lines in the upper panel of Fig.[2]for the results. 

In addition, we require the clusters to have good pho- 




20 22 24 26 28 30 

magnitude 



Figure 2. Upper panel: Completeness fractions based on artifi- 
cial stars. Bottom panel: Completeness fractions based on arti- 
ficial clusters with FWHM = 1 pixel. Thin solid lines with bul- 
let points: Completeness functions for the individual filters. The 
horizontal axis is the magnitude in the respective passband. Thin 
dashed lines: Completeness functions for cross-correlating the fil- 
ters. The horizontal axis is the magnitude in the V band. Thick 
dot-dashed lines: Completeness functions for cross-correlating the 
filters and requiring good photometry (uncertainties $C 0.2 mag). 
The horizontal axis is the magnitude in the V band. XID = cross- 
correlation. Both panels relate to the DISC region. 



tometry, with photometric errors smaller than 0.2 mag in 
each passband. This requirement further decreases the com- 
pleteness fractions. The results are shown in the upper panel 
of Fig. [2] by the thick dot-dashed lines. 

However, neither the Gaussian shape nor the small size 
of the artificial stars are realistic representations of a YSC. 
We therefore built more realistic cluster models (with in- 
trinsic FWHM = 1 and 2 pixels, as explained above), using 
BAOlab, taking both the appropriate HST/ WFPC2 PSFs 
(base d on the Tiny Tim software package; see lKrist fc Hook] 
l2004h and the HST/WFPC2 diffusion kernel into account. 
We then performed the detection, cross-correlation and pho- 
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magnitude 



Figure 3. Completeness fractions based on artificial clusters, tak- 
ing the cluster sizes and size determination effects into account. 
Black lines correspond to clusters with intrinsic FWHM = 1 pixel, 
light grey lines (in colour print: green lines) to clusters with in- 
trinsic FWHM = 2 pixels. Solid thin lines with bullet points: only 
cross-correlation (here only the cross-correlations using the flat 
SED are shown, for reasons of clarity). Long-dashed thin lines: 
cross-correlation including photometric accuracy determination; 
thick dot-dashed lines: full completeness fractions including size 
determination. Upper panel: DISC region, bottom panel: OVERLAP 
region. XID = cross-correlation. 



tometry tests once again. The results are shown in the bot- 
tom panel of Fig. 

For the real clusters an additional constraint imposed 
is the size determination. Hence for each cluster the size 
determination has to converge. Since the size fitting will not 
converge easily for faint clusters, this further reduces the 
completeness fraction of the source detections. In addition, 
all methods are sensitive to the cluster size. These results are 
given in Fig. [3] which also provides the comparison between 
the completeness determined in the DISC region with that 
for the overlap region. 



Table 2. Comparison of 50 per cent completeness limits for ar- 
tificial sources of different types (stars versus clusters; 2 cluster 
sizes), different regions within the Antennae system, and different 
methods of completeness determination. XID = cross-correlation. 



model star cluster 


cluster 


cluster cluster 


DISC DISC 


DISC 


OVERLAP OVERLAP 


1 pix 


2 pix 


1 pix 2 pix 


50 per cent completeness 


limits in 


individual passbands 


U only 25.1 23.8 


23.2 


24.6 24.0 


B only 25.0 23.7 


23.1 


24.8 24.1 


V only 25.5 24.2 


23.6 


25.3 24.5 


/ only 26.1 24.6 


24.1 


25.2 24.6 


50 per cent completeness 


limits of 


cross-correlated data 


XID only 24.9 23.6 


22.9 


24.6 23.8 


phot 23.8 23.2 


22.6 


23.9 23.5 


phot + size - 22.2 


22.5 


23.6 23.5 



(,(1 




-16 -14 -12 -10 -8 -6 -16 -14 -12 -10 -8 -6 



magnitude magnitude 

Figure 4. LFs of the final cluster sample in the four passbands, 
not corrected for the effects of sample (in)completeness. The 
shaded histogram shows the full cluster sample, the open his- 
togram represents the small cluster sample. 

3 LUMINOSITY FUNCTIONS 

Fig. [4] shows the cluster LFs, i.e. the number of clusters per 
magnitude bin, for the four passbands. While the LFs ap- 
pear to exhibit a turnover, the decrease at the faint end is (at 
least partially) caused by the observational incompleteness. 
To separate the effects of (in) completeness from a possible 
intrinsic decrease in the number of faint clusters (i.e. an in- 
trinsic turnover) , we designed the statistical tools described 
in Section [4] 

3.1 Comparison with data in the literature 

The pioneering work of Whitmore and collaborators has re- 
sulted in the largest homogeneous dataset of star cluster 
photometry for the Antennae system. For a direct compari- 
son Brad Whitmore kindly provided us with the photometry 
of their full (unpublished) source sample. 
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Comparison of luminosity functions for Antennae sources 

Our data: black open + blue dashed fit; Whitmore et al 99: green filled + dark green solid fit 
Our data without ACs: magenta open + orange fit 

3 




-15 -10 -15 -10 

magnitude magnitude 

Figure 5. LFs of the final cluster sample in the four passbands, compared to the corresponding data from IWhitmore et al.l (|l999h . The 
slopes, a, of the linear fits to the logarithms of the LFs are indicated in the legends. This slope is equivalent to the LF power-law slope 
"LF- 



As can be seen from Fig. [5] our LFs are substantially 
flatter than those of the Whitmore sample. This can be par- 
tially attributed to the effect of the ACs: bright clusters in 
our sample tend to be larger (i.e. have larger ACs), which 
flattens the LF. Overall, our sample without the ACs is in 
better agreement with the Whitmore sample (despite the 
vastly different sample sizes). In addition, the different se- 
lection criteria probably also play an important role in the 
slope differences, as our selection cr iteria largely exclude th e 
stellar contamination discussed in IWhitmore et alj (| 1999T ) , 
which is expected to dominate the faint end of the LF and 
thus will lead to a steepening of the LF. 



4 TURNOVER DETERMINATION 

Particularly important for our study of the LFs is the poten- 
tial existence and the location of a turnover in our dataset to 
establish the link between our YSCs and the old GCs. Un- 
fortunately, the completeness function drops off already at 
significantly brighter magnitudes. This results in an appar- 
ent turnover of the observed cluster luminosity distribution 
for any reasonable intrinsic luminosity distribution (which 
we call the ILF), like a power-law or Gaussian distribution. 
Therefore, the process that leads to the observations, has to 
be modelled carefully before any conclusion can be drawn 
with respect to the existence of a possible turnover. 

In Section 14.11 we present a detailed statistical model 
for the observations and develop an appropriate statistical 



procedure to test for the presence of a turnover. In section 
I4.2l we apply these methods to the Antennae YSC data. 



4.1 Modelling the observations 

The observed luminosity distribution is different from the 
"true" ILF by virtue of two different stochastic mechanisms. 
First, the observations are affected by measurement errors. 
Even more important though is a so-called "missing data" 
mechanism described, in essence, by the completeness func- 
tion. Here, the probability of a cluster to be included in 
the observations Xi (i = 1, . . . ,N) depends on its bright- 
ness. The fainter the cluster, the less probable it is to be 
observed. Moreover, if the cluster is fainter than Mv ~ —5 
to —6 mag, the completeness function approaches zero, and 
it is not possible to analyse this part of the LF at all. For 
a general intr oduction to sta t istical methods with missing 
data s ee, e.g., Little & Rubin (2002), for a biometric exam- 
ple see lPatil fc Raol (|l978l ). 

We now introduce the statistical model in more detail. 
Call the (intrinsic) LF, i.e. the (probability) density of 
the brightness of a cluster, and assume that the intrinsic 
luminosities of the Antennae clusters are realisations, m;, of 
independent random variables, Mi, distributed according to 
Here, upper-case letters indicate random variables, and 
lower-case letters their realisations (i.e. the observed value). 
In the literature, essentially only log- normal ("Gaussian") 
and power-law models are under consideration for where 
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■& are the parameters of the model, to describe the LFs of 
star clusters. 

Please note: We are going to fit both the Gaussian and 
the power-law models to the distribution of cluster magni- 
tudes. Although in the literature the power-law models are 
generally fitted to the luminosity distributions, our method 
is a conservative approach, as it assigns less weight to the 
(potentially less well-defined) faint end. Hence, if we can 
show that a power law model for the cluster magnitudes 
predicts too many faint clusters, this would, a fortiori, hold 
for a power law in cluster luminosities. 

Even if the completeness function would be unity for 
all magnitudes, the measurements would not be equal to 
the m,i, but scatter around these true values due to mea- 
surement errors. We call e(-\M) the probability density of 
the measurement error of a cluster with intrinsic brightness 
M. More formally, e(-\M) is the conditional density of the 
measurement errors given that the intrinsic cluster bright- 
ness is M, i.e. the probability density that the measurement 
error has a certain value if we already know that the intrin- 
sic brightness is M. Then, the observable brightness is given 
by the random variable Yi — Mi + Ei, where the Ei are 
distributed independently of each other, as e(-|Mj). We call 
Yi the observable brightness, as opposed to the observations 
Xi, since not all of the clusters (with observable brightness 
Yi) will actually be included in an observed sample due to 
the (in)completeness effects. 

The error density e(-\M) can, for example, be a Gaus- 
sian density with fixed variance, i.e. independent of M, or a 
Gaussian with a variance which depends on M. Since the 
observed values basically result from photon count data, 
a Poisson distribution with expected value computed from 
Mi is another straightforward possibility. We use the latter 
choice. However, computations where we assumed Gaussian 
noise yield very similar results. This is because the measure- 
ment errors are small, and in particular modify the observed 
cluster distribution much less than the missing data effect 
described next. 

The probability of a cluster with observable brightness 
Y to be included in the observations is given by the com- 
pleteness function c(Y). We model this by a random variable 
Z which is Bernoulli distributed with parameter c(Y), i.e. 
Z is 1 with probability c(Y), and otherwise. 

Recapitulating, the observable brightness is modelled as 
pairs (Yi, Zi), i = 1, . . . , N of a random variable Y = M + E 
with probability densities of M, e(-\M) of E, and g$ 
of Y, and a second random variable Z, which is Bernoulli 
distributed with parameter c(Y). The observations consist 
of those pairs (Yi, Zi) where Zi = 1. This implies that the 
probability density h of an observation Xi is the joint density 
p$ of the pair (Y, Z) conditioned on the event Z = 1: 



h(x) = p$(x\z — 1) 



pg(x,z = 1) 
Po(Z = 1) : 



where P$(Z = 1) is the (marginal) probability that Z = 1. 

In the remainder of this section we describe our esti- 
mator for the best-fit values $ mod of the parameter(s) 1? 
for the Gaussian and power-law models, respectively, i.e. 
"mod" reads either as "Gaussian" or "power law". Our ap- 
proach is based on maximum likelihood estimation, which 
amounts to maximising the probability of the observed data 



(Yi,Zi = 1), i — l,...,n, given the model "model" with 
parameter(s) i.e. 

tf mod = argmax^£^(Xi, . . . , X N ), 

where 

n 

£4(X U ...,X N ) = UpT^z = 1), 

and p™ od (Y = Xi\z = 1) is the (conditional) probability of a 
cluster to have observable brightness Xi given Z = 1, i.e. to 
be actually contained in the observations. To compute this 
conditional density we first determine the marginal proba- 
bility that Z — 1 as 



oo 

T od (^ = 1) = I pT\* = l\y)gT d (y)dy 



p™ od (y\m)ftf(m)dm 



dy. 



Here we use that p™ od (z = 1 |V) = c(Y) (i.e. does not de- 
pend on the specific model at all) and 



gT d (y)= j e(y - m\m)f# oi (m)dm. 



Here, e(y— m\m) is the probability that Y = y given M — m. 
Moreover, 

mod / -i \ mod / -i | \ mod / \ 

p,, (x,z = l) =p# (z = l\x)g0 (x) 



□a 

= c(x) j e(x ~ m|m)/™ od (m)dm. 

— oo 

Therefore, the likelihood function of the observations 
Xi , . . . , Xn is 



£™° d (:ri, . . .,x N ) = YI 



c(xi) J e(xi-m\m)f^ od (m)dm 



pmod( Z=1) „ 



Maximisation of £™° d (Xi, . . . ,Xn) yields the estima- 
tors ("best-fit parameter(s)") i? mod . Moreover, we assign the 
likelihood to the respective model under consideration as a 
measure of its "probability" given the data. This can be 
used to select the best model, particularly by comparing the 
ratio of the likelihoods of the best-fit Gaussian and power- 
law model in a likelihood ratio test. Versions of this kind 
of model selection are well esta blished for a br oad range of 
statistical applications (see e.g. lLehmandll994r ). 



4.2 Statistical significance of a turnover in the 
ILF 

We now discuss the results of our application of this method 
to the Antennae clusters. For simplicity, we parametrise the 
completeness functions as Fermi functions, 



(1 + exp(a(x — 6))) 1 a, b£ 
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We emphasise that the general results do not change if we 
use, e.g., interpolation methods to describe the completeness 
function. 

For the V band, the completeness data as determined 
in Section ^. 21 (crosses') and the fitted parametrisation (solid 
line) are shown in Fig. [6] Three completeness fractions were 
determined per magnitude, corresponding to the three dif- 
ferent cluster SEDs considered, giving some uncertainty es- 
timate. The fit was performed as L\ - Fit (i.e. by minimising 
the absolute distance between model function and simulated 
completeness fractions) . 

Fig. [7] shows the best fits for the Gaussian and the 
power-law models for the different passbands. A visual in- 
spection of Fig. [7] already indicates that the intrinsic Gaus- 
sian models fit better, particularly with respect to the faint 
tail of the LF. The intrinsic power-law models fit quite well 
in all bands for bright magnitudes but assign too much 
weight (i.e. predict too many clusters) to the faint magni- 
tudes. One might suspect that this is due to the complete- 
ness curve assigning too much probability to that magni- 
tude range. However, an inspection of Fig. [6] shows that for 
— 7 ^ Mv ^ —6 mag, where the power law predicts too 
many clusters, the completeness curves estimate the gen- 
erated completeness fractions without significant bias. This 
observation can also be made for the other bands. Moreover, 
fitting a different completeness curve (e.g., a linear interpo- 
lation of the mean/median of the completeness fractions at 
each magnitude) leads to the same effects. One has to keep 
in mind that the completeness fraction plays a crucial role 
when fitting a model in this context. 

More robust conclusions can be drawn using the model 
likelihoods. To test the (null) hypothesis "true model is 
power law with the estimated best fit parameters" against 
the alternative of a Gaussian density we use a Likelihood 
Ratio test (cf. iLehmanr] 1 1994 ) . We first simulate the dis- 
tribution of the ratio of the likelihoods of a Gaussian and 
a power-law model under the hypothesis that the power- 
law model holds true. This means that we randomly draw a 
number of observations equal to the number of data point 
from the fitted power-law model (using the fitted complete- 
ness curves) and then again fit a power law and a Gaussian 
model and calculate the ratio of the corresponding likeli- 
hoods. This procedure is repeated 1000 times and the em- 
pirical quantiles of the resulting distribution are compared 
to the observed likelihood ratio of the data. Table [3] shows 
that the probability to erroneously reject the null hypothesis 
is at a level of less than 0.5 per cent for the U, B and V band, 
i.e. the result is "strongly significant" (in fact, for most cases 
considered none out of 1000 simulations, where the ILF is 
assumed be a power-law, shows a superiority of the Gaus- 
sian over the power- law as strong as in the real data). Note, 
that we estimate the probability of erroneously rejecting the 
null hypothesis from the rate of rejecting the power law in 
favour of a Gaussian in simulations with artificial data gen- 
erated from the power law model. Here, the result of zero 
rejects out of 1000 simulations implies an estimate of the 
true rate of rejection in the simulations of p ^ 0.5%, with 
an error probability of this claim to be wrong of < 1%, based 
on the characteristics of Bernoulli-distributed random vari- 
ables. For the I band, the observed value is greater than 74 
per cent of the simulated values. This means that the cor- 
responding probability of erroneous rejection (i.e. the corre- 



Table 3. Results of the Likelihood Ratio tests. For clarity the 
difference of the negative log-likelihood values is given instead of 
the likelihood ratios. Hence, negative values indicate that the null 
hypothesis (i.e. power law) is superior. The last column gives the 
probability of error for discarding an intrinsic power law (hence 
small values represent the inferiority of the power law). 





Quantiles 










Filter 


50% 


95% 


99% 


100% 


Obs. 


Prob. 


U 


-4.5 


0.1 


2.5 


5.0 


15.5 


0.000 


B 


-6.0 


-0.9 


1.3 


6.2 


32.5 


0.000 


V 


-5.4 


-0.05 


2.5 


9.4 


15.3 


0.000 


1 


-1.2 


1.8 


3.2 


5.5 


-0.4 


0.322 


FWHM: 


=0.5-2.36 pixel 


-> rjy 2 


~ 5-25 


pc 


U 


-0.9 


2.6 


5.0 


6.8 


18.5 


0.000 


B 


-1.3 


3.0 


4.6 


8.9 


35.5 


0.000 


V 


-0.5 


4.0 


5.9 


8.9 


30.8 


0.000 


1 


-0.1 


3.4 


5.2 


7.7 


16.8 


0.000 



Table 4. Best-fitting parameters for the Gaussian model, (fi = 
mean) 







Gaussian 




Filter 


M 


95 percentile range n 




U 


-9.0 


[-9.4 


-8.5] 


2.3 


B 


•8.8 


[-9.0 


-8.5] 


2.0 


V 


■8.5 


[-8.8 


-8.1] 


2.1 


I 


-6.2 


[-7.3 


-5.0] 


2.7 


FWHM 


=0.5-2.36 pixel 




5-25 pc 


U 


-8.6 


[-8.9 


-8.2] 


1.7 


B 


-8.3 


[-8.5 


-8.0] 


1.4 


V 


-8.4 


[-8.6 


-8.1] 


1.4 


I 


-8.0 


[-8.4 


-7.5] 


1.5 



sponding p-value) is 0.322 (i.e. 32.2 per cent) and the null 
hypothesis cannot be rejected at any reasonable level. 

In Table |4] we present the parameters and the uncer- 
tainty ranges from bootstrapping for the best-fitting Gaus- 
sian models. 

We conclude that the statistical methods yield impor- 
tant evidence against power-law-like distributions for our 
dataset, thus hinting at the presence of a turnover in the 
ILF at absolute magnitudes between —8.0 and —9.5 mag in 
the U, B, and V band. In the I band, the different models 
seem to fit the data with the same quality. Note that even an 
ILF that is flat to the right of the peak of its best-fit Gaus- 
sian would predict too many faint clusters, because even a 
Gaussian distribution predicts rather too many than too few 
faint clusters. However, such a model is at the borderline be- 
tween any possible presence and absence of a turnover in the 
intrinsic distribution. In particular, a rising ILF such as a 
power law is even less likely. 

4.3 Cluster ages, cluster masses and infant 
mortality 

We determined ages from our broad-band p hotometry us- 
ing the AnalySED algorithm, described in lAnders et al.l 
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OVERLAP Region 
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2 pixel clusters 




magnitude 




magnitude 



Figure 6. The completeness curve fits determined for the V band. Crosses: Data points from completeness tests. Gray solid line: Fermi 
function fit to the data points. Note that for different bands only the location on the magnitude axis varies but not the shape of the 
curve. 



(004a). Both the best ages and representative lcr uncer- 
tainty ranges are shown in Fig. [8] Combining the average 
age of our full cluster sample of «25 Myr and solar metal- 
licity with the parameters in Table |4] yields the following 
parameter estimates for the MF: log(MTo[M©]) = 4.2 and 
a m = 0.8 5 dex. Compared to th e values for the Milky Way 
(see e.g. lAshman fc ZepJ [l99sl log(M med ian[M©]) = 4.9, 
log(A/ mcan [MQ]) = 5.3, om = 0.49 dex), this turn-over ap- 
pears to be shifted to lower masses, and to be broader. Note 
that we prefer to apply our statistical treatment to the lumi- 
nosities rather than to the (physically more relevant) cluster 
masses, because the crucial completeness determination can 
be performed accurately only for the (observed) magnitudes 
and not for the (derived) masses. 

Observations of increasing numbers of interacting and 
starburst galaxies show a significantly larger number of 
young (< 10 — 30 Myr) star clusters than expected from 
a simple extrapolation of the cluster numbers at older ages, 
under the assumption of a roughly constant star cluster for- 
mation rate over the host galaxy's history, and taking into 
account the observational com pleteness limits as well as the 
effects of sample binning (see Ide Griis fc Parmentierl 120071 
for an in-depth review). The current consensus is that an 
initial fast (< 10 — 30 Myr) disruption mechanism can effec- 
tively remove up to 90 per cent of the youngest, short-lived 
clusters from a given cluster populati on. This process has 
been coined cluster "infant mortality" |Whitmorell2004l ). 

Taking the best ages in Fig. [5] at face value, the time- 
scale of infant mortality appears to be around 18-25 Myr 
(depending on the binnin g in age), in roug h agreement with 
previous claims (s ee e.g. IWhitmorel [200 1, iFall et all [20051. 
iMengel et alll2005l ; see also I Whitmore et al.ll2006l for a pre- 
sentation of earlier results), but perhaps somewhat shorter 




loglclnsler age |yr|) 

Figure 8. Age-vs.-mass relation for the full cluster sample. 
In the lower-right corner mean error bars for all clusters are 
shown. The upper error crosses give the mean uncertainties 
for clusters with ages < 10 Myr, between 10 and 100 Myr 
and > 100 Myr, respectively. The solid lines represent the fad- 
ing lines, as given by the GALEV mo dels (solar metallicity, see 
lAnders fe Fritze-v. Al vcnslcbcn 2003) and the completeness lim- 
its determined in Section l2.2l Both lines correspond to the com- 
pleteness curves for artificial clusters with 1 pixel size. Upper line: 
DISC region. Lower line: OVERLAP region. 



than theoretical predictions bv lGoodwin fc Bastianl {2006), 
possibly due to the high galactic background and/or the vi- 
olent environment. Assuming a constant cluster formation 
rate during the last < 100 Myr the infant mortality rate is 
of order 60 per cent, i.e. «40 per cent of the newly formed 
clusters survive the first ~25 Myr. 
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Figure 7. Fits of the Gaussian and the power-law model for the different passbands. The grey line is a kernel density estimate of 
the observed data and the black line shows the Maximum Likelihood fit multiplied by the weighted sum of the completeness curves 
(with weights chosen according to the relative frequencies of observations for the different regions and cluster sizes). The dashed line 
corresponds to the estimated LF without considering the completeness. The vertical scaling is in arbitrary units (a.u.), but scaled such 
that the modelled LF (i.e., the ILF convolved with the distribution of observational errors, multiplied with the completeness function) 
gives the same number of clusters as observed. 



4.4 Investigating subsets of the full cluster sample 

In this section we investigate the existence of a turnover 
in the LFs of subsets of the full cluster sample, using our 
newly developed statistics tools (Section l4.HI . We limit these 
analyses to the V-band magnitudes. 

First, we analysed the photometric data not corrected 
for size-dependent ACs, in order to exclude the possibility 
of calibration errors from the sizes/ AC algorithm. The data 
for the V band are shown in Table [5] 

In addition, we study three subsamples in cluster size 
(radius) and several subsamples in cluster age. 



Two of the three size bins were chosen to be centred 
around the cluster sizes we studied the completeness func- 
tions for previously. The third bin contains all clusters larger 
than in the first two bins. 

First, we chose three age bins roughly equally spaced in 
log(age/yr). In addition, we investigated subsamples which 
are (or are not) expected to be affected by infant mortality : 
following the theoretical work bv lGoodwin fc Bastianl (|2006t ) 
we divided the cluster sample in clusters younger and older 
than 50 Myr. As a second test, based on the observed age 
distribution of our Antennae cluster sample, we divided the 
cluster sample into clusters younger and older than 25 Myr, 
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Table 5. Results of the Likelihood Ratio tests for several sub- 
sets, performed for the V band. For clarity, the difference of the 
negative log-likelihood values is given instead of the likelihood 
ratios. Hence, negative values indicate that the null hypothesis 
(i.e. power law) is superior. The last column gives the probability 
of error for discarding an intrinsic power law (hence small values 
represent the inferiority of the power law). The italic values are 
for clusters in the size range FWHM=0.5-2.36 pixel — > ri/2 — 
5-25 pc. 

Quant iles 



Subset 


# 


50% 


99% 100% 


Obs. 


Prob. 


Without ACs 


V band 


752 


-0.8 


4.9 


7.6 


20.2 


0.000 


V band 


365 


-0.2 


5.7 


11.4 


23.1 


0.000 


Radius bins 


< 1.5 pix 


275 


-0.2 


6.2 


8.0 


27.5 


0.000 


[1.5:2.5] pix 


104 


-0.6 


2.0 


3.7 


2.6 


0.004 


> 2.5 pix 


37.3 


-8.0 


-0.4 


1.1 


24.4 


0.000 


Age bins 


< 12 Myr 


382 


-3.9 


2.4 


9.4 


4.8 


0.001 


[16:100] Myr 


328 


-2.2 


3.0 


6.9 


8.7 


0.000 


> 100 Myr 


42 


-0.6 


2.2 


3.5 


0.4 


0.118 


< 25 Myr 


541 


-5.4 


1.9 


5.7 


5.0 


0.001 


> 25 Myr 


211 


-0.8 


4.2 


7.5 


13.5 


0.000 


< 50 Myr 


627 


-5.7 


2.0 


7.5 


10.6 


0.000 


> 50 Myr 


125 


-0.4 


3.8 


6.6 


7.2 


0.000 


^ 12 Myr 


179 


-0.6 


4.2 


6.8 


9.1 


0.000 


[16:100] Myr 


163 


-0.4 


4.3 


6.2 


20.4 


0.000 


> 100 Myr 


23 


-1.6 


1.8 


3.4 


0.8 


0.054 


< 25 Myr 


254 


-0.9 


4.5 


7.7 


14.3 


0.000 


> 25 Myr 


111 


0.5 


5.0 


6.5 


20.1 


0.000 


< 50 Myr 


291 


-0.8 


4.8 


6.6 


19.8 


0.000 


> 50 Myr 


74 


-2.7 


3.4 


8.2 


6.5 


0.002 



as this seems to be the disruption time we see in our cluster 
sample. 

As can be seen from Table [5] all but one of the sub- 
sets considered confirm the findings for the full data set. 
The only exception involves clusters with ages > 100 Myr, 
but these subsets consist of 23/42 clusters, too few to base 
unambiguous statistical conclusions on. 

No significant difference is seen for the subsets during 
or after the infant mortality phase. 



4.5 Investigating the effects of artificial decrease 
of completeness 

We performed tests with the completeness functions artifi- 
cially degraded, using offsets of 0.5 mag, 1 mag and 2 mag. 
The results are shown in Tables [6] and [7] and in Fig. [9] 

As one might expect, the resulting best-fit model LFs 
become more and more skewed towards the bright end with 
increasing offsets, becoming increasingly unlike the observed 
distribution. For the full cluster sample, the power-law ILF 
becomes increasingly superior with increased offsets, as one 
might expect. For the sample containing only the "small' 
clusters, the superiority of the Gaussian is retained even 



Table 6. Results of the Likelihood Ratio tests for either the full 
sample or the subset with half-light radii between ~ 5 and 25 pc, 
for different artificial shifts in the completeness functions, per- 
formed for the V band. For clarity, the difference of the neg- 
ative log-likelihood values is given instead of the likelihood ra- 
tios. Hence, negative values indicate that the null hypothesis (i.e. 
power law) is superior. The last column gives the probability of 
error for discarding an intrinsic power law (hence small values 
represent the inferiority of the power law). These values relate to 
Fig. M 



Subset 


# 


Quantiles 
50% 99% 100% Obs. Prob. 


Shift by 0.5 mag, all clusters 


V band 


752 


-1.9 4.0 5.8 1.3 0.072 


Shift by 1.0 mag, all clusters 


V band 


752 


-1.1 3.0 6.1 -3.3 0.971 


Shift by 2.0 mag, all clusters 


V band 


752 


-3.9 0.6 16.3 -11.2 1.000 


Shift by 0.5 mag, small clusters 


V band 


365 


-0.1 5.3 7.0 17.9 0.000 


Shift by 1.0 mag, small clusters 


V band 


365 


0.2 5.5 9.8 15.6 0.000 


Shift by 2.0 mag, small clusters 


V band 


365 


1.0 6.3 8.6 21.6 0.000 



by degrading the completeness. This follows naturally from 
intrinsic selection effects in which larger clusters of a given 
brightness will more easily be missed in our sample selection 
than smaller clusters, owing to their lower surface bright- 
nesses. 

For the full cluster sample the peak of the Gaussian 
distribution shifts strongly towards fainter magnitudes with 
increasing offset, i.e. in the range where the completeness 
functions are significantly greater than both tested distri- 
butions resemble power laws. This effect is not as significant 
for the cluster sample with the narrower size range, again 
because of the intrinsic selection effects referred to in this 
respect. 



5 DISCUSSION 

5.1 Star Cluster Systems and their progenitors: 
LFs and MFs 

The mass spectra of the SC progenitors, GMCs and GMC 
cores, in nearby galaxies are found to be of power law-type, 
with a slope of — 2.0 < a < —1.5 over a wide range of cloud 



and core masses (ISolomon et al.l 19871. iKramer et al. 19981. 



Zinchenko et al.l Il998l . iRosolowskvl 120051 . iRathborne et ail 



2006), possibly with the exception of c louds in the oute r 
parts of the Milky Way and in M33 (see IRosolowskvl 120051 ) . 
which might be represented by a steeper power law. All these 
galaxies, however, are neither currently interacting nor ex- 
periencing a strong starburst. 
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Table 7. Best-fitting parameters for the Gaussian model, corre- 
sponding to Fig. [9] and Table [6] 



Filter 


Gaussian 

fjL 95 percentile range in fx 




Shift by 0.5 mag, all clusters 


V 


-6.8 [-7.5:-5.9] 


2.5 


Shift by 1.0 mag, all clusters 


V 


-4.4 [-5.6:-2.6] 


2.8 


Shift by 2.0 mag, all clusters 


V 


-2.0 [-3.7:+0.7] 


2.7 


Shift by 0.5 mag, small clusters 


V 


-7.8 [-8.1:-7.3] 


1.5 


Shift by 1.0 mag, small clusters 


V 


-7.4 [-7.8:-6.8] 


1.5 


Shift by 2.0 mag, small clusters 


V 


-7.0 [-7.5:-6.4] 


1.3 



A first attempt to assess the m olecular cloud mass spec- 
trum in the Antennae galaxies, bv lWilson et alj (|2003f ). re- 
vealed a power law with a ~ — 1.4 but remained limited to 
GMC masses above 5 x 10 6 Mq. Resolution of lower-mass 
molecular clouds, observations of molecular cloud cores and 
the determination of the cloud MF - Is it different in mas- 
sive gas-rich mergers compared to non-interacting galaxies? 
- have to await commissioning of ALMA. 

For old GC systems both LFs and MFs are of Gaus- 
sian shape: the LFs show a Gaussian turnover magnitude 
at My — —7.3 mag and a Gaussian a — 1.2 mag. As GCs 
seem to be roughly coeval, at least within the associated un- 
certainties, with ages of 10-12 Gyr, the peaked luminosity 
distribution can be converted into a mass distribution. The 
corresponding peak mass occurs at ~ 2 x 10 5 Mq. YSCs 
in the local Universe show LFs that appear generally char- 
acterised by power laws, at least towards the upper mass 
limit. It is not clear, however, whether or not the distinc- 
tion between YSCs and GCs is meaningful for all galaxies, 
and whether it reflects an intrinsic difference in the nature 
of these clusters (i.e. their formation mechanisms) or origi- 
nates from a initially universal continuous cluster distribu- 
tion re-shaped by secular evolution and destruction of the 
low er-mass S Cs. 

iLarsenl (|2004T l reports the detection of so-called su- 
per star clusters (SSCs) in a number of seemingly undis- 
turbed, quiescently star-forming face-on spirals. These SSCs 
are clearly very bright and very young; at l east some of 
them have been shown to be very massive, too (|Larsen et al.l 
120041 ). With masses around 2 x 10 5 to 1.5 x 10 6 M Q and 
small radii (half-l ight radii between 3.0 and 5.2 pc; see 
lLarsen et al.ll2004T ). they resemble young GCs, although on- 
going GC formation is not expected in these non-interacting, 
non-starbursting, just normally star-forming galaxies (see 
also Section 15. 2\ . If these massive SSCs are not very rare 
exceptions, we would have to investigate where the descen- 
dants are of all those SSCs that presumably formed earlier 



on, i.e. the intermediate-age GCs in those spirals, or whether 
we are witnessing a very special epoch in the star-formation 
history of those spirals. 

Starburst and merging galaxies are observed to harbour 
rich systems of YSCs formed during the starburst event. 
Merger remnants with post-starburst signat ures also reveal 
SC systems with ages of up to 1-3 Gyr (e.g.. Ide Griis et all 
1003d iGoudfrooii et al.l 120041 ) . indicating that at least some 
fraction of these SCs formed during the burst survived much 
longer than most of the open clusters in the Milky Way. 

The universality or non-universality (and possibly the 
environmental dependence) of the LFs of SC systems in qui- 
escently star-forming galaxies compared to merger-induced 
starbursts provides important input for models of star and 
star cluster formation. 



5.2 SC formation and star formation efficiencies 

The lifetime of a SC depends on its mass (and stellar ini- 
tial mass function), its initial degree of binding, and on its 
environment. While apparently all star formation is SC for- 
mation at the stage of embedded clusters in the Milky Way, 
most of the clusters are already unbound wh en they emerge 
from their dust cocoon lILada fc Ladal f2003). Hydrodynam- 
ical cluster formation modelling has shown that very high 
SFEs are required to form a massive cluster that is long-term 
stable. The formation of a typical GC, sufficiently massive 
and strongly bound to survive for a Hu bble time, requires 



a SFE of order 30 per cen t or higher dBrown et al. 19951 . 
lElmegreen fc Efremovll 19971 . iBastian fc Goodwinll20061 ).Te. 
at least an order of magnitude higher than SFEs in nor- 
mal spiral and irregular galaxi e s, or in dwarf galax y star- 



bursts (cf. iKrueger et al.Nl995l . iMurgia et al.l 120021 ). SFEs 
as high as 30 per cent or more, however, are observed 
in global and nuclear starbursts triggered by massive gas- 
rich mergers, such as N GC 7252, and ultra-luminous in- 
frare d galaxies (ULIRGs; |Fritze-v. Alvensleben fc Gerhard! 



Il994l . iGao fc Solomon! l2004r i and led to the idea that GCs 
might for m in these events and not only in the early 
Universe (Frit ze-v. Alvensleben fc Burkert|[l995l . ISchweizerl 
|2002| . |2003| )~ 

In addition, observations of a gap in cluste r ages but not 
in field star ages in the LMC {liich el al. 2001 '; suggest that 
SC formation there took place in stages of enhanced star 
formation only, possibly related to close encounters of the 
LMC with the Milky Way and/or the SMC. Similarly, star 
cluster formation in M51 is found to have been significantly 
enhanced d uring the last close e ncounter with its companion 
NGC 5194 llBastian et al.ll005ai) . once again when the overall 
star formation and the SFE are expected to be enhanced. 

Although little is known about molecular cloud struc- 
tures in interacting and merging galaxies, a clear difference 
does exist in the molecular gas content between normal 
star-forming galaxies and ULIRGs. We know that while the 
CO(1-0) line traces molecular gas at densities n ^ 100cm -3 , 
the HCN(l-O) and CS (1-0) lines trace gas at densities 
n 10 4 cm~ 3 and n ^ 10 5 cm -3 , respectively. 

Submillimetre observations show that for quiescently 
star- forming galaxies only a small fraction (~ 0.1 — 3 per 
cent) of all their (CO) molecular gas is at the high densi- 
ties of molecular cloud cores, as traced by HCN or CS, i.e. 
L(HCN)/L(CO) ~ 0.001 - 0.03. In contrast, ULIRGs show 
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that, averaged over volumes of (10 — 300 pc) 3 , the ratio cently verified ob servation ally for the Antennae g alaxies by 
M(H CN)/M(CO) can reach up to 0.3 - 1 (|Gao fe Solomonl 
2004), i.e. the molecular cloud structure must be very dif- 



ferent from the case of quiescently star-forming galaxies - 
to the point that it becomes very difficult to imagine much 
internal structure at all, if essentially all the molecular gas 
is at molecular cloud core densities. 

The mass ratio of high-density gas to the total gas 
mass (as traced by the L(HCN)/L(CO) ratio) in ULIRGs 
is higher by a factor of ~ 5—10 when compared with nor- 
mal star-forming galaxies (iGao fc SolomorJ [2004). Since the 
mass in high-density gas correlates linearly with the star- 
formation rate (SFR: as measured by the far-infrared flux; 
IGao fc Solomonll2004l ). ULIRGs therefore also possess a star 
formation per unit total gas mass higher by a factor ~ 5 — 10 
compare d to normal S F galaxies. 

The lSc"hmidtll|l959l) law relating the surface densities of 
the SFR and of hydrogen gas (Hi from direct observations 
and H2 as traced by CO) by Esfr oc £™ as with n ~ 1.4 
is valid from quiescently star-forming spirals all the way 
through to ULIRGs, probab ly the most active ly star-forming 
galaxies in existence today l|Kennicutd ll998) . over almost 5 
orders of magnitude in gas surface density and 6 orders of 
magnitude in SFR density. However, if this relation is ex- 
pressed in terms of high-density g traced by HCN or 
CS, instead of the low-density gas traced by CO, it becomes 
linear, indicating that star formation is more fundamentally 
governed by th e content of high density gas, not the overa ll 
gas content fsee lSolomon et al]|l992 . lGao fc Solomonll2004h . 
The non-linearity in the iKennicutt ( 19981 ) law might then 
arise from an environment-dependent time scale and/or ef- 
ficiency to transform Hi into H2 and the low-density gas 
into high-density gas. This is an important issue to consider 
in hydrodynamical modelling of galaxies and galaxy merg- 
ers, which then needs to account for a multi-phase ISM and 
include a careful description of phase transitions, star for- 
mation and feedback processes. 

Star formation in normal galaxies, spirals and irregu- 
lars, is thought to occur through the collapse of molecular 
clouds, whereby the mass spectrum apparently remains self- 
similar from molecular clouds through molecular cloud cores 
all the way to the mass spectrum of open star clusters, all of 
which ar e power laws with m ~ —1.5 to —2 (|Lada fc Ladal 
120031 . see lElmegreen fc Efremovlll997l for a theoretical foun- 
dation), implying a constant SFE. However, observations 
show considerabl e SFE scatter between different clouds and 
clust ers (see e.g. lLada et al.lll997l . lKodll999l . lLada fc Ladal 
2003), although this quantity is naturally very difficult to 
measure. As the SFE is linked strongly no n-linearly with the 
surviving bound fracti o n of the cluster ( Lada et "all Il984l. 
iGever fc ; Burkerd l200ll . iFellhauer fc Kroupal |2005I l~ devia- 
tions from this self-similarity are not coming as a surprise. 
Their impact on t he form of the cluster LF (and MF) is stud- 
ied extensively in iParmentier fc Gilmore] (|2006l ). where it is 
shown to explain the occurrence of a turnover in the cluster 
LF derived from a power-law cloud LF (or MF). However, 
this whole issue is currently under vigorous debate (see also 
below). 

In interacting galaxies, the frequency of molecular cloud 
collisions is expected to increase strongly and this will 
considerably enhance star formation. Moreover, molecu- 
lar clouds get shock-compressed by external pressure (re- 



cently verm cd observationall y tor the A ntennae galaxies by 
lHaas et alj|2005l . although Iwhitmore et""al"1l2005l report the 
absence of shock- heated gas close to their sample of YSCs), 
grow denser and more massi ve, and this process can drive 
up the SFE very efficiently jjjog fc Solomonl 1 19921 . iBarnesI 
l2004l) . |Jog fc Das! i| 19921 . 11999 ) have shown that a relatively 
small increase in the external ambient pressure to values 3 
- 4 times the internal pressure within the molecular clouds 
in the undisturbed galaxy can drive SFEs up to 70 - 90 per 
cent. 



5.3 Evolution of SC systems 

Based on the observed power-l aw LFs of YSCs by 
van den Bergh fc Lafontamel (1984) for the Milky Way and 
Hunter et al l (120031 ) for the LMC, a power-law LF is usu- 



ally als o adopted for YSC systems in interacting galax- 
ies (see |de Griis et al.l l003bl for a recent compilation; also 
IZhang et al.ll200ll . IZepf et al.lll999l . lde Griis et al.ll200ll ). al- 
though in most cases these LFs cannot be traced to be- 
low the expected turnover magnitude, if there were one. In 
addition, in such violently star-forming systems the stellar 
contamination of the sample is more difficult to deal with, 
and the determination of observational completeness frac- 
tions is difficult because of strongly variable backgrounds. 
Non-negligible age differences among the YSCs lead to dis- 
tortio ns of the LFs with respect to their underly ing, intrinsic 
MFs jMeuredll995l . lFritze-v. Alvenslebenlll999l ). 

Approaching this issue from the opposite point of 
view, it is usually argued that the underlying MF is 
most likely a power-law, based on obser vations of the 
CMC MFs in nearby normal galaxi es (e.g. , Solomo n et al.l 
I l987l. lElmegreen fc Efremovl 1l997l iKramer et all 1 19981. 



I Zinchenko et all Il998l . iRosolowskvl 120051 . iRathborne et all 
120061 ). This argument has two drawbacks, however: 

(i) It assumes that the cluster mass correlates directly 
with the mass of the precursor GMC, and hence must be 
subject to a constant SFE. As indirect measure, a corre- 
lation between mass and radius would then be expected. 
The presence of a mass-radius relation fo r GMCs and its 
absence for clusters l|Ashman fc Zepj|200ll) . however, casts 
doubt on the validity of this assumption. Present studies on 
these mass-radius relations are limited to nearby and there- 
fore only relatively undisturbed galaxies, however. The im- 
pact of the observed cloud-cloud scat ter in SFEs on the clus- 
ter LF /MF was recently studied by IParmentier fc Gilmord 
(2006). Here, it was shown that this scatter can transform a 
power-law MF of GMCs into a bell-shaped star cluster MF. 

(ii) It assumes that the GMC MFs in interacting galaxies 
are similar to t hose in nearby normal galaxies. However, as 
recently shown (|Gao fc S olomon 2004), of relevance for star 
formation is not the mass of the GMCs, but the mass of the 
dense cores embedded within these GMCs. The fraction of 
gas in these cores compared to the total gas mass is substan- 
tially higher in violently interacting and star-forming galax- 
ies, compared to quies cently star-forming isolated galaxies 
l|Gao fc Solomonl |2004| ). The MFs of both GMCs and of 
their cores in the nearest interacting and starburst galaxies, 
e.g. such as in the Antennae galaxies, will be determined 
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only to sufficiently low massed when the next generation 
(sub) millimetre observatory ALMA becomes operational in 
the next decade. Until then, the initial cluster MF will likely 
be the best proxy, but see (i) above. 

It has hitherto remained unclear whether the difference 
in shape between the powe r -law LFs of young SCs (bu t see 
Fritze-v. Alvenslebenl Il998l . 1 19991 . Ide Griis et all l003d and 



Goudfrooii et all 120041 for the Gaussian LFs in the An- 



tennae galaxies, M82B and NG C 1316, and the new re- 
sults bv lde Griis fc Andersl koOfj on the LMC SC system) 
and old GC systems is caused by differences in the nature 
and formation of the two types of clusters, or whether the 
power law of young systems is secularly transformed into 
the Gaussian distribution of old GC systems by selective 
destruction effects. Models for the evolution of SC systems 
in galactic potentials have naturally obtained a Gaussian 
shape from an ini tial power law by selectively destroying 
low-mass clusters (jFall fc Zhang 2001). Detailed dynamical 
studies that also include dynamical friction as an impor- 
tant disruption mechanism for massive clusters (|Vesperinil 
2000) show that although it is possible to obtain a final 
Gaussian distribution from an initial power-law, significant 
fine-tuning of the model parameters is required to obtain the 
observed GC mass function. In contrast, an initial Gaussian 
shape similar to that observed today is conserved for a wide 
range of assumptions despite the disruption of more than 
50 per cent of the original cluster population. In addition, 
an initial Gaussian shape with parameters differing from 
the presently observed ones tend to evolve to a s hape with 
the p resently observed parameters at GC ages (|Vesperinil 
|2000| ). It is not yet possible to do this kind of modelling 
in the time-varying ga l actic potent ials of merging galaxies. 
IParmentier fc Gilmorel <|2005l . l2006l ) have shown that the ini- 
tial cluster MF of the Galactic GC system very probably was 
already either Gaussian in shape or described by a truncated 
power law with a cut-off at the same p osition as the turnove r 
of the Gaussian distribution. Similarlv. lde Griis et al.l (|2005l ) 
show that the initial clus ter MF in the post- starburst re- 
gion B in M82 (shown in Ide Griis et alj l003d to possess a 
present-day cluster MF of roughly Gaussian shape) was also 
most likely of Gaussian shape, supported by plausibility ar- 
guments regarding the initial stellar density in this region. 



5.4 Our results in this general framework 

Our findings represent the youngest star cluster system for 
which a Gaussian-shaped LF fits statistically significantly 
better than a power-law distribution. T his is an exten- 
sion o f the results found observationally bv lGoudfrooii et all 
(2004) (~ 3 Gyr-old c luster system i n the merger rem- 
nant NGC 1316) and Ide Griis et all l|003ch (~ 1 Gyr- 
old cluster system in the fossil starburst region B in 
M82). It suppo rts the detailed models by Vesp erini (see 
IVesperinil 2000 for a Gaussian initial LF and IVesperinU 



7 Following IWilson et all ||2003| 1 and the ALMA sensitivity 
calculator 

(http:/ /www. eso.org/projects/alma/science/bin/sensitivity.html), 
we roughly estimate limiting GMC masses to be observed in 
the Antennae galaxies by ALMA around 6 X 10 4 Mq (1 hr 
observation) to 1 X 10 4 Mq (30 hr observation). 



2001 for the power-law cas e) . ([Parmentier fc Gilmorei r2005. 
200611 and Ide Griis et"afl (120051 ). In the framework of 
IParmentier fc Gilmorel (120061 ) the occurrence of the turnover 
at such early times is a natural outcome of shaping the LF 
during the earliest stages of cluster evolution, namely the 
phase of the rem oval of gas left over from c luster formation. 
IVesperinil (|2000h and Ide Griis et~aH (|2005l ) show the neces- 
sity for an init ial or very early Gauss i an LF , which is in 
agreement with IParmentier fc Gilmorel (|2006l ). 

The proposed universality of the Gaussian shape of SC 
LFs, although not necessarily their (possibly environment- 
dependent) parameters, has far-reaching consequences for 
studies of star and star cluster formation. It opens the pos- 
sibility to study the formation of massive bound clusters 
(proto-GCs) at close reach, a process that was previously 
thought to have taken place exclusively in the early Uni- 



6 SUMMARY 

In this paper we have studied the luminosity functions of the 
YSCs in the Antennae galaxies (NGC 4038/39), the nearest 
ongoing merger of two spiral galaxies. The merging process 
is accompanied by a strong burst of star formation, and par- 
ticularly star cluster formation. We carefully select a sample 
of clearly extended sources, and hence remove the otherwise 
strong contamination by bright stars within the Antennae 
galaxies. 

The complex background, caused by the interaction and 
the contamination by bright stars, hampers the accurate 
determination of the observational completeness. However, 
an accurately determined completeness function is essential 
to determine whether or not the intrinsic LF of the cluster 
system shows a turnover like the old GC systems. We present 
a number of tests on the completeness functions, and showed 
that it is essential to include all cluster selection criteria in 
the determination of the completeness function. 

We attempted to fit the intrinsic LFs with the most 
widely used models, i.e. a power-law distribution and a 
Gaussian distribution, taking the completeness function and 
the photometric errors into account. We find statistically sig- 
nificant evidence that the LFs of this sample of clearly ex- 
tended sources are best described by an intrinsic Gaussian 
distribution. We determine the chance of error (i.e. the prob- 
ability that we erroneously discard a better-fitting power-law 
distribution) using Monte Carlo-simulations, and find it to 
be below 0.5 per cent. 

Earlier claims of power-law LFs suffer most likely from 
the strong stellar contamination and the difficulties in the 
completeness determination. In addition, the statistical tools 
developed for this study are beyond the sensitivity of any 
commonly used tools. 
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